Rank in Wordlist | Frequency | Word |
---|---|---|
8399 | 59 | 1,5 |
11578 | 40 | 2,5 |
16305 | 26 | 3,5 |
24505 | 15 | 4,5 |
28616 | 12 | %, |
30454 | 11 | 5,5 |
30457 | 11 | 6,5 |
37695 | 8 | 1,7 |
41197 | 7 | 3,7 |
41206 | 7 | 4,4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
59033 | 4 | 8(21 |
62188 | 4 | ВКП(б |
70370 | 3 | 8(843 |
99803 | 2 | Гайнетдин(с |
103170 | 2 | РКП(б)ның |
119303 | 2 | ук(ыт)у |
124879 | 1 | 11(24 |
125244 | 1 | 15(28 |
125245 | 1 | 15(28)мартның |
125364 | 1 | 16(1 |
Rank in Wordlist | Frequency | Word |
---|---|---|
69955 | 3 | 1)кырымтатарча |
87849 | 2 | %) |
88373 | 2 | 2)кырымтатарча |
103170 | 2 | РКП(б)ның |
104625 | 2 | Төркия)кырымтатарча |
104900 | 2 | Финляндия)кырымтатарча |
112141 | 2 | киңәшчесе)парламентка |
119303 | 2 | ук(ыт)у |
124437 | 1 | %)* |
124438 | 1 | %). |
Rank in Wordlist | Frequency | Word |
---|---|---|
10126 | 47 | 100% |
10308 | 46 | 30% |
10488 | 45 | 50% |
10695 | 44 | 10% |
10901 | 43 | 80% |
11337 | 41 | 90% |
11579 | 40 | 70% |
14183 | 31 | 40% |
14553 | 30 | 20% |
15373 | 28 | 25% |
Rank in Wordlist | Frequency | Word |
---|---|---|
42470 | 7 | Гүзәлия&Радик |
59198 | 4 | Ell&Niki |
71304 | 3 | T&B |
89994 | 2 | Marks&Spencer |
129384 | 1 | Belem&IT |
131439 | 1 | H&M |
134168 | 1 | Oscar&c7c5 |
135468 | 1 | S&P |
145201 | 1 | hub&spoke |
Rank in Wordlist | Frequency | Word |
---|---|---|
69877 | 3 | $185,000 |
87845 | 2 | $1 |
87846 | 2 | $1,5 |
87847 | 2 | $1.4 |
87848 | 2 | $9 |
88728 | 2 | 60$ |
124403 | 1 | $1,8 |
124404 | 1 | $1.50 |
124405 | 1 | $10 |
124406 | 1 | $10-50 |
Rank in Wordlist | Frequency | Word |
---|---|---|
6866 | 75 | mäs'älä |
9024 | 54 | Ul'yan |
9191 | 53 | mäs'älälär |
10927 | 43 | tä'min |
11610 | 40 | tä'sir |
18582 | 22 | festival'dä |
18600 | 22 | mäs'älär |
21602 | 18 | festival'neñ |
22544 | 17 | fol'klor |
22578 | 17 | mäs''älä |
Rank in Wordlist | Frequency | Word |
---|---|---|
124437 | 1 | %)* |
Rank in Wordlist | Frequency | Word |
---|---|---|
23382 | 16 | 1/8 |
35517 | 9 | Азатлык/Азат |
45597 | 6 | 9/11 |
51155 | 5 | 1/3 |
51156 | 5 | 1/5 |
51258 | 5 | 2/3 |
69963 | 3 | 1/16 |
75911 | 3 | Европа/Азатлык |
87986 | 2 | 1/4 |
88687 | 2 | 50/50 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots